Extracting information networks from the blogosphere
نویسندگان
چکیده
منابع مشابه
Extracting Information Networks from the Blogosphere: State-of-the-Art and Challenges
We study the problem of automatically extracting relational networks of recognizable entities from the blogosphere. We describe a highly parallel and efficient system capable of processing millions of blog posts in a few hours, and experiments based on state-of-the-art techniques for the extraction and identification of entities and relations. From these, we devise a tuned approach that achieve...
متن کاملExtracting Meta Statements from the Blogosphere
Information extraction systems have been recently proposed for organizing and exploring content in large online text corpora as information networks. In such networks, the nodes are named entities (e.g., people, organizations) while the edges correspond to statements indicating relations among such entities. To date, such systems extract rather primitive networks, capturing only those relations...
متن کاملExtracting Information from Multiplex Networks
Multiplex networks are generalized network structures that are able to describe networks in which the same set of nodes are connected by links that have different connotations. Multiplex networks are ubiquitous since they describe social, financial, engineering, and biological networks as well. Extending our ability to analyze complex networks to multiplex network structures increases greatly t...
متن کاملBlogBuster: A Tool for Extracting Corpora from the Blogosphere
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, suitable for linguistic and language technology research and development, has attracted significant research interest recently. Several general purpose approaches for removing boilerplate have been presented in the lit...
متن کاملExtracting hidden information from knowledge networks.
We develop a method allowing us to reconstruct individual tastes of customers from a sparsely connected network of their opinions on products, services, or each other. Two distinct phase transitions occur as the density of edges in this network is increased: Above the first, macroscopic prediction of tastes becomes possible; while above the second, all unknown opinions can be uniquely reconstru...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on the Web
سال: 2012
ISSN: 1559-1131,1559-114X
DOI: 10.1145/2344416.2344418